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Abstract 

We propose a new theoretical and practical framework for the study of light-matter interactions 
and the angular momentum of light. Our proposal is based on helicity, total angular momentum, 
and the use of symmetries. We compare the new framework to the current treatment, which is 
based on separately considering spin angular momentum and orbital angular momentum and using 
the transfer between the two in physical explanations. In our proposal, the fundamental problem 
of spin and orbital angular momentum separability is avoided, predictions are made based on the 
symmetries of the systems, and the practical application of the concepts is straightforward. Finally, 
the framework is used to show that the concept of spin to orbit transfer applied to focusing and 
scattering is masking two completely different physical phenomena related to the breaking of differ- 
ent fundamental symmetries: transverse translational symmetry in focusing and electromagnetic 
duality symmetry in scattering. 

PACS numbers: 42.50.Tx,11.30.-j 
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I. INTRODUCTION 



In the last decades, the angular momentum of light has received much attention from very 
diverse areas of Physics. From experimental astrophysics proposals for the detection of exotic 
cosmic objects [l[ to the use of light beams to rotate atoms [2j, through the exploitation 
of the infinite number of possible angular momentum values for increasing the capacity of 
optical communication networks j^] or developing new concepts in quantum information (J). 
The list of applications is long indeed j^]. 

The availability of appropriate theoretical tools for the study of light beams with angular 
momentum and their interactions with matter is crucial for the development of such a wide 
and promising range of applications. The current state-of-the-art theoretical framework is 
based on the separation of angular momentum J in its orbital (L) and spin (S) components: 
J = L + S. In the paraxial approximation the value of the total angular momentum along 
the optical axis can be split into a term which depends on the azimuthal spatial phase of the 
field and a term which depends on the polarization 6]. Several efforts have been undertaken 
to rigorously extend this approach to the non-paraxial regime, but have encountered funda- 



mental difficulties 



Strongly non-paraxial tightly focused light fields are the bread and 
butter of applications where light is made to interact with nano-structures, molecules and 
atoms. The mechanism of spin to orbit angular momentum conversion (SAM to OAM), also 
referred to as spin-orbit interaction, is the explanation of choice for numerically obtained 



observations in focusing |9Mll| and remarkable results in scattering experiments 12Ml4| . For 
example, the presence of optical vortices in tightly focused fields and in scattered fields is 
explained by conversions between the two types of angular momenta. 

Rigorously speaking, though, the separation between SAM and OAM cannot be made on 
firm physical grounds. Consequently, a conversion between the two quantities is not a fully 
satisfactory explanation for physical phenomena. The separate consideration of S and L is 
known to pose fundamental problems for the electromagnetic field {?], Q and its quantum 



excitations [15, §16], [16|, p. 50]. From the point of view of quantum field theory only the 
total angular momentum operator is a valid observable property of the photon. Even more 
generally, the strict non-separability is not restricted to photons. For example, it also applies 
to the electron since only in the non-relativistic limit can the orbital and spin parts of its 
angular momentum be separately considered jl^, §16]. The fundamental reason for such 



non-separability is the geometry of rotations of vectors and spinors. Such a restriction also 
applies to rotations of classical electromagnetic fields jl7l |. 

In this article we put forward an alternative theoretical framework for the general and 
rigorous treatment of the angular momentum of light and its role in light-matter interactions. 
Our approach solves the theoretical difficulties of the current framework, draws its predictive 
power from symmetry considerations, and can be simply applied in practice. By using it, 
we discover that the actual physical reason responsible for the presence of optical vortices 
in tightly focused beams is totally unrelated to the one responsible for the appearance of 
optical vortices in scattering experiments. In the current state-of-the-art framework, both 
cases are explained as SAM to OAM conversion. 



lie 



icity. The role of helicity (A) 



18] . The macroscopic Maxwell 



Our proposal is based on total angular momentum and 
in light matter interactions has recently been considered 
equations have been shown to be invariant under generalized electromagnetic duality trans- 
formations, and helicity has been identified as the generator of those transformations. By 
exploiting this connection, helicity can be used within the powerful formalism of symmetries 
and conserved quantities for the study of light matter interactions when the approximations 
implicit in the macroscopic Maxwell equations hold [19l chap. 6]. The use of symmetries 
and conserved quantities for the study of electromagnetic problems is the paradigm used in 
this article for the development of its theoretical concepts and their application to practical 
situations. 

In section Ull we outline the paradigm, mathematical concepts and notation used through- 
out the paper. In section II I II we summarize the different aspects involved in the separation 
of SAM and OAM. In section IIVI we outline our proposal. First, we discuss the concept 
of helicity and its associated symmetry and comment on a result from [3], which shows 
that, upon scattering, helicity transforms independently of the geometry of the scatterer. 
Then we show that the combined use of angular momentum and helicity solves the problems 
associated with the separation of SAM and OAM in a way that is simpler than the existing 
theoretical solutions and comment on the practical applicability of our ideas. Finally, we 
establish a relationship between helicity eigenstates and the transverse electric (TE) and 
transverse magnetic (TM) components of the field. Using this relationship we express the 
conservation law for helicity as a function of the partial scattering matrices in the TE-TM 
basis. This expression becomes useful in the practical application of the framework. In 



section [V] we use the developed ideas to revisit the concept of SAM to OAM conversion 
in focusing and scattering. We are able to clearly identify the underlying reasons for the 
presence of optical vortices in focused and scattered fields, which happen to be two totally 
different physical phenomena connected to the breaking of two independent fundamental 
symmetries: transverse translational symmetry in focusing and electromagnetic duality in 
scattering. Up to now, the two are explained by SAM to OAM conversion. During section 
\V\ we provide the analytical tools necessary for the practical application of our framework. 
Section [VI] contains our conclusions and discussion. 

II. PARADIGM, MATHEMATICAL SETTING AND NOTATION 

The paradigm that we follow in our work is the use of symmetries and conserved quantities 
for the study of electromagnetic problems. In this article we consider classical Maxwell fields. 
Symmetry operations like rotations and translations are linear transformations that apply 
to the fields. Similarly, we model the light-matter interactions as linear transformations of 
the free space fields. The fields themselves will hence always be transverse. These ideas are 
best formalized using the mathematical setting of Hilbert spaces. 

Therefore, in this article we make extensive use of the concepts and tools associated with 
a vector space endowed with an inner product, i.e. a Hilbert space, and the linear operators 
acting within that vector space. The vector space we consider is the space of transverse 
solutions of Maxwell's equations, or transverse Maxwell fields, which we call M. A linear 
operator within M takes one of its vectors, a transverse Maxwell field, and transforms it 
into another vector of the space, still a transverse solution of Maxwell's equations. We are 
interested in symmetry transformations of the fields: time translation, spatial translations 
and rotations, etc. These transformations are operators acting within M. Such continuous 
symmetries are generated by hermitian operators, also acting within M, which are asso- 
ciated with properties of the fields. For instance, energy generates time translations, the 
components of linear momentum generate spatial translations and the components of angu- 
lar momentum generate spatial rotations. A hermitian operator O generates a continuous 
symmetry transformation T{9) by means of 




(1) 



n 
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See [17|, §.21] for the detailed study of classical Maxwell fields using angular momentum 
and its generated transformation, spatial rotations. The fact that M has an inner product 
allows us to speak of hermitian operators. It also allows us to construct orthonormal basis 
into which any transverse Maxwell field can be expanded. The basis vectors can be chosen 
to be transverse fields which are simultaneous eigenvectors of four commuting hermitian 
operators. The choice of the inner product is not necessarily unique, even though the one 



we have used, defined in section HV CI has a long tradition for vector fields 20|, expr. 13.1.21]. 

The consideration of M, the hermitian operators associated with the fundamental quan- 
tities of the field and the transformations that these operators generate, allows to study 
electromagnetic problems using Maxwell fields together with the powerful framework of 
symmetries and conserved quantities. When the electromagnetic equations of a given sys- 
tem are invariant under the transformations generated by a given hermitian operator, the 
property of the field associated with that operator is a constant of the motion. Conversely, 
if the system does not possess that symmetry, we know that the associated property can, in 
general, change during evolution. The effect of the symmetry of the system is even stronger, 
as it must also preserve the eigenvectors and eigenvalues of the operator generating the 
symmetry, thus offering an adequate basis of vectors to solve the electromagnetic problem. 
As we will show, this is a simple yet insightful approach to electromagnetic problems. In 
this article, we want to exploit this approach for the study of the angular momentum of 
light. Consequently, for our purposes, we may only use properties of the electromagnetic 
field which are associated with a hermitian operator in M. Only then can we consider 
their associated symmetry. This rules out the separate use of the components of L and S, 
since their associated operators transform a transverse Maxwell field into a non-transverse 



field 21(, 16|, By. 2]: they do not act within the required vector space. From this point of 
view, L and S are qualitatively different from J, the linear momentum P, the energy H or, 
as we will discuss, helicity A. In the most commonly used representation, the expressions 
corresponding to these operators are 

H = ^ P = -<V, J = L + S, A=^, (2) 

L = —ir x V, S* m = -ie knm , (3) 

where S k , the k-th component of S, is a matrix of indexes nm defined using the totally 
antisymmetric tensor e^nm with ei 2 3 = 1. As discussed above, the operators in equation (j2J) 



transform a transverse Maxwell field into a transverse Maxwell field, while those in equation 
([3]) break the transversality of the fields. 

In our notation we use capital letters like J z and P x to denote operators, and lower case 
letters like j z (or n) and k x to denote their eigenvalues. When we speak of a field having a 
"sharp" or "well defined" value for an operator, we mean that the field is an eigenvector of 
that operator with eigenvalue equal to its "sharp" value. In the text, names like "helicity" 
or "third component of angular momentum" refer to the operators unless it is clear from 
the context that this is not the case. Also, all analytical calculations from section IIVCI on, 
assume a time harmonic decomposition of the fields with an exp(— iwt) dependence. 

Finally, we would like to mention that the main context of this work is that of classical 
Maxwell fields. Nevertheless, the approach here taken, which is based on the study of 
symmetries, is general and often used in Quantum Mechanics and other areas of modern 
Physics. 



III. THE SEPARATION OF L AND S 



Serious theoretical difficulties are encountered when attempting to separately consider L 

n n n n 

and S for the electromagnetic fields (|7[, |8[) or its quanta (|15|, §16],[16j, B/.2]). We also 



know 



2l|, 



lq . B/.2] that the operators L and S break the transversality of the fields, taking 



a vector of M out of that space. As far as we are concerned, this prevents the separate use of 
L and S for studying electromagnetic problems using symmetries and conserved quantities: 
L and S do not generate symmetry transformations for vectors in M. 

On the other hand, the total integrated value of the angular momentum of the electro- 
magnetic field, expressed here in convenient units as an integral over all space involving the 
electric and magnetic fields E(r) and B(r) 



(J) 



dr r x (E(r) x B(r)), 



(4) 



can be separated into two gauge invariant integrals. To achieve such separation one needs 
to consider only the transverse parts of the field, which are always gauge independent. This 
restriction is justifiable because the degrees of freedom associated with the longitudinal part 
of the electric field can always be combined with the degrees of freedom of the sources 
I.B.5],[2a chap. XXI,§22]. Then, in 
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(J t ) = J dr r x (E t (r) x B(r)) (5) 

3 

= [dv (Et(r) x A t (r)) + [ dv £^(r)(r x V)^(r) = (S t ) + (L t ). (6) 

where index z refers to the three spatial components, the two terms (St) and (L t ) are 
apparently gauge invariant. From now on, we will drop the t underscripts since in this article 
we only consider transverse Maxwell fields. The identification of the two parts of equation 
with spin and orbital angular momenta is tempting due to the appearance of the operator 
L = -ir x V and the relationship of the cross-product in (S t ) with the spin-1 matrices 
representing S. But, since L and S are not operators in M, the question arises of which 
are the operators corresponding to the two parts of equation ([6]). In his book on Quantum 
Mechanics, Messiah 22|, Ch. XXI, problem 7] offers an expression which corresponds to 
;he second quantization of the first part of equation In 1994, Van Enk and Nienhuis 



2l| , in a more detailed study, arrived at the same result and also derived the expression for 
the second part of equation ([6]). In that work, they showed that the two operators are not 
angular momenta because they do not satisfy the commutation relations which define angular 
momentum operators. These Fock space operators have their corresponding operators in M 



for classical fields (see section HVBI) . and their third components commute 23| . It is hence 



possible to split the total angular momentum J into two operators S = A-ipr and L = J — S. 
Consequently (L t ) and (S t ) in (JSJ) are in reality (L) and (S). Unfortunately, in terms of 
the separation of the angular momentum operator, this approach is not fully satisfactory 
because the resulting operators are not angular momentum operators. This can be easily 
proved by checking that their vectorial components do not fulfill the commutation relations 
required for angular momentum operators. The consequence of this is that they cannot 
separately generate meaningful rotations. In summary, J can always be decomposed into 
two meaningful operators, J = L + S, independently of the paraxial approximation. These 
two operators are never angular momenta since they never fulfill the correct commutation 
relationships. This last statement is also independent of the paraxial approximation. See 
section IIVBI for a more detailed discussion. 

Following the paradigm discussed in section [TXJ, we only consider properties of the field 
whose corresponding operators generate transformations in M. As already mentioned several 



times, this disqualifies L and S. As substitutes, one may choose L and S. We prefer to 
disregard the question of the separation completely and use the total angular momentum 
J and the helicity instead. Our choice is based on the fact shown below (section IIVBI) 
that J and helicity generate very simple symmetry transformations which lead to a simpler 
framework. 

Finally, it is worth noting that expressions (j3J) and (JSJ) can be interpreted and computed as 
a weighted average. When the electromagnetic field is decomposed in a basis of eigenvectors 
of J z , the total integrated value (J z ) is equal to a weighted average [19j, chap. 9.8, expr. 
9.143]. The weights are the squares of the amplitudes in the linear decomposition of the field, 
and they multiply the different eigenvalues j z of each mode in the basis. This connection 
is not restricted to angular momentum. All the integrals that are used to compute total 
integrated values of properties of the electromagnetic field can be interpreted and computed 
as weighted averages involving the squares of the expansion coefficients and the eigenvalues 
of the operator related to each particular property. This connection is the key step used for 
obtaining a Fock space operator from the classical spatial integral involving the fields. See 
for instance [2J, chap. 10.2.3, 10.5] for the cases of H and P respectively This connection 
relates the algebraic formalism introduced in sec. |TT]with the well known spatial integrals 
involving the fields, like (jlj) for the case of angular momentum. 



IV. A FRAMEWORK BASED ON HELICITY AND ANGULAR MOMENTUM 



A. The helicity of light fields and its associated symmetry 



Helicity is defined as the operator which results from projecting the total angular mo- 



mentum onto the linear momentum 



25 



chap. 8.4.1], i.e. A = J ■ P/|P|. Helicity commutes 



with all the generators of rotations J and translations P 25|, chap. 10.4.3]. In the case of 



the photon, the helicity can only take the values ±1 [26|, chap. 2.5]. For the electromagnetic 
field, A has only two eigenvalues equal to ±1. A useful interpretation of helicity is obtained 
by considering the expansion of the field as a superposition of plane waves. In such a rep- 
resentation, helicity is associated with the handedness of each plane wave. For a particle to 
have a well defined helicity, all the plane waves must be purely circularly polarized and have 
the same handedness with respect to its momentum vector. This is illustrated in Fig. [TJ 
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A = 1 



A = -1 



A undetermined 




FIG. 1. (Color online) A field composed by the superposition of five plane waves has definite 
helicity equal to one if all the plane waves have left handedness (left part), equal to minus one if 
they all have right handedness (central part) and does not have a definite helicity if all the plane 
waves do not have the same handedness (right part). 

Given that a general light beam can always be expanded as a sum of plane waves and that 
helicity is associated with the polarization handedness of those plane waves, helicity seems a 
very suitable candidate for representing the polarization degrees of freedom of a general light 
beam. Crucially, the action of helicity on a plane wave affects only its polarization vector: 
it leaves the plane wave momentum vector invariant. This is also true for other types of 
light beams like multipolar fields or Bessel beams: the action of helicity does not change the 
quantities related to energy, linear or angular momenta which define those beams. Helicity 
acts on separated degrees of freedom, which, in this article, we refer to as the degrees of 
freedom of the polarization. 

From its definition, it is readily checked that helicity is a hermitian operator. With respect 
to related transformations, helicity is the generator of the generalized electromagnetic duality 
transformation of the fields: 

E ->■ cos(#)E - sin(0)H, 

(7) 

H -> sin(#)E + cos(#)H, 
where 9 is a real angle. That helicity generates duality is a remarkable fact which can hardly 
be deduced from the mathematical definition of helicity containing the angular and linear 
momenta. 
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Generalized duality (J7|) is well known to be a symmetry of the source free microscopic 



Maxwell equations. The results in 



18] allow to extend the relationship between helicity 



and duality to the macroscopic Maxwell equations in material systems. That work shows 
that the helicity of the light interacting with a piecewise homogeneous isotropic medium will 
transform independently of the shapes of the different material domains, and that helicity 
will be preserved by the interaction if and only if the ratio between the relative electric and 
magnetic constants of all the involved materials i E [1 . . . N] is constant, that is: 

- = «V«. (8) 

When condition (jSJ) is met, the macroscopic Maxwell's equations for the system are 
invariant under transformation ([7]). 

The geometry independent character of the duality symmetry (helicity conservation) 
allows to separately consider the transformations of the polarization degrees of freedom 
from the geometry of the scattering problem. Such notable simplification is very desirable 
in a framework for the study of light matter interactions. 

We propose the use of helicity, the generator of duality transformations, for treating the 
polarization degrees of freedom in electromagnetic problems. We will show that, when used 
in conjunction with the total angular momentum, the generator of rotations, we obtain a 
general framework for the study of problems involving the angular momentum of light and 
its interaction with matter using the language of symmetries and conserved quantities. It is 
important to recall that, in this article, we will remain within the approximations implicit 
in the macroscopic Maxwell's equations [ji], chap. 6]. 

For completeness, a expression of the total integrated value of helicity in terms of elec- 
tromagnetic fields can be found in j^]. In this article, though, we are not concerned with 
the total integrated value of helicity. Instead, we use modes with well defined helicity and 
consider their transformations in particular situations, which will depend on whether the 
system is invariant under electromagnetic duality. 



B. Theoretical and practical considerations when using A and J z 



The fundamental problem of the separability of S z and L z is avoided by using A and 
J z instead. Both J z and A are commuting hermitian operators in M, and generate two 
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simple and independent symmetry transformations of the electromagnetic field: J z generates 
rotations around the z axis and A generates the generalized duality transformation ([7j). 



As previously mentioned, the two 
tization of expression (jSJ) studied in 



bck space operators obtained from the second quan- 



operators for classical fields: in 



21] (and also later in [28]) have their corresponding 



23] they are obtained in the momentum space. For instance, 



it can be seen that S z , the operator substituting S z is AP 2 /|P|. This operator commutes 



with L 7 



23 



28] . whose expression is obviously L z = J z — AP 2 /|P|. Consequently, we could 
use S z and L z instead of J z and A. Considering the symmetries generated by each pair of 
operators we prefer to choose J z and A for reasons of simplicity. While J z and A are related 
to the two simple symmetries indicated above, the symmetries related with S z and L z are 
more complicated. An explicit mathematical expression for the transformation generated 



by S can be found in [29|]. For once, S z involves a combination of duality and translational 
symmetries, while the symmetry related to L z has not been properly studied as such, up to 
our knowledge. This suggests that using J z and A should, in most situations, result in a 
simpler analysis. For instance, it is explicitly seen in section IV Bl that the conservation law 
associated with AP 2 /|P| is not independent of the geometry of the problem. 

In the paraxial limit, when P z — > P, it can be shown that S z — > A and L z — > J z — 
A. Even in this limit, neither of these operators generates physical rotations for the full 
electromagnetic field. For a paraxial beam, the helicity may be approximated by the real 
space circular polarization component perpendicular to the z axis and, the value of J z — 
A coincides (see equations f|T7|) - f|T8|) below) with the azimuthal phase winding number of 
the dominant circular polarization. It is customary to use the paraxial correspondence 
between SAM and circular polarization and OAM and azimuthal phase. From the previous 
considerations, we think that it would be more insightful to use A and J z — A, which have 
the advantage of retaining its meaning outside the paraxial approximation. 

From the experimental point of view, section IV CI shows that the preparation of general 
(non-paraxial) beams with well defined helicity can be done using simple optics in a straight- 
forward fashion. The measurement of the helicity state of a general beam is also shown to 
be easily achieved using simple optics. 

In this article we focus in the combined use of A and J z . However, it is worth mentioning 
that the fact that A commutes with all the generators of rotations J and translations P, 
and that it generates a transformation which is independent of geometry, should allow its 
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combined use with other degrees of freedom different from angular momentum when the 
particular problem requires it. 



C. Helicity preservation conditions on the TE-TM scattering matrices 

In this section we arrive at a relationship between helicity conservation and the scattering 
of the transverse electric and transverse magnetic components of the field. This relationship 
is useful in practical problems, as will be seen in section |V] 

Let us consider the general scattering problem presented in figure (T5]). An incident elec- 
tromagnetic field E ira impinges onto a scatterer of arbitrary shape S resulting in the scat- 
tered field E sc . Consider an orthonormal basis of transverse electromagnetic modes with 
well defined helicity {A+,A~} V u, where the superindex denotes the sign of A and the 
subindex v is a composed index which contains three other commuting operators. For ex- 
ample v = [H,P x ,P y ] for plane waves, with H being the energy, and P x , P y the first two 
components of linear momentum. 



Both incident and scattered fields can be expanded using the {A^, A^} basis, and the 
scattering process is fully characterized by the following transformation of the incident field: 




E; 



in 



Ecr — S E; 



in 



FIG. 2. (Color online) Scattering problem of arbitrary geometry 



E. 



'sc 



SE;. 




dv \ dvSZtA+Af + Sll+ApAf + S^+A+Af + S*'~A^A~ E, 



(9) 



E. 



'sc 
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where At is the hermitian conjugate of A. In fl9]), the transformation S is represented by a 
doubly infinite weighted sum of projection operators of the type AA^, whose action on the 
input field is (AAt)E sc = A(A"l"E in ) and the inner product 

A f E in = ( drA(r) f E iri (r), (10) 



is assumed. 

Transformation is specified by the infinite set of 2 x 2 partial scattering matrices 
relating the coefficients of E in in A+, A~ to the coefficients of E sc in Ap~ , Ap for all v, v: 

(qv,+ C^)+\ 
V V - (11) 

Let us impose helicity conservation upon scattering, forcing all the partial scattering 
matrices to be diagonal: S"'_ = S"'+ = for all v, v. 

Consider the following linear combinations of modes with well defined helicity. 

M, = -L ( A + + A;) , N„ = (A+ - A;) , (12) 

which transform into each other by the action of A 

AM, = N„, AN, = M„. (13) 

In appendix El we show that, for plane waves, Bessel beams and multipoles, the sum 
and subtraction of modes with same v and different helicity result in what are commonly 
known as TE and TM modes. We will adopt the TE-TM naming to denote general sum and 
subtractions of modes differing only by their sharp helicity eigenvalue. 

Since {A+,A~} is an orthonormal basis, so is {M„,N V }. After changing (jHJ) to the 
{M u , N„} basis, the condition for helicity preservation upon scattering, expressed in the 
TE-TM basis reads: 



qU,TE qU,TM 
qU,TE qU,TM 




v,— 



si 



qV,- 



qV,+ , qp 



(I 



... K 

hi al 



(14) 
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for all {y, v). Namely, that the scattering of TE and TM components is on an equal footing 
for all (z/, u), as can be seen from the scattering matrix having the same values a v v in the 
diagonal and b v v in the off-diagonal. Condition ( |14p is clearly a restriction which will not be 
met in general. We conclude that, in general, a scatterer will partially convert the helicity 
of the incident field and that this partial helicity conversion is reflected in asymmetries of 
the scattering matrices with respect to the TE and TM modes. 

Since helicity conservation is equivalent to invariance under generalized duality transfor- 
mations, having information on the TE-TM scattering properties of a system can be used 
to assess its duality invariance and vice versa. 

In systems with a high degree of symmetry, like a sphere or a planar multilayer system, a 
wise choice of basis simplifies condition (|14p . The symmetries of those two systems make the 
TE and TM multipoles |l9l chap. 9] and the TE and TM plane waves (definition contained 
in appendix |B]) the eigenmodes of the spherical and planar structures respectively. This 
means that S v v = unless v = v and that S^ E = S^m — for all u, v. All the partial 
scattering matrices are diagonal. The preservation of the TE-TM components is related 
to the geometrical symmetries of the system. Using Mie scattering theory and Fresnel's 
formulas, it is an easy and interesting exercise to analytically verify that, in these two cases, 
when the materials meet ([8]), all the scattering matrices are proportional to the identity 
(^v'te = ^uTM f° r a ^ Z/ )' hence preserving helicity as well. 



V. EXEMPLARY APPLICATION OF THE FRAMEWORK: REVISION OF SPIN 
TO ORBIT ANGULAR MOMENTUM CONVERSION IN FOCUSING AND SCAT- 
TERING 

The conversion between spin and orbital angular momentum is widely used to explain 
phase singularities in numerical simulations of tightly focused fields js il], and in scattering 
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141 ] . A detailed discussion of the SAM to OAM conversion can be found 



experiments: 
in Q. 

We will now use symmetries and conserved quantities arguments, particularly those re- 
lated to J z and A, to identify the actual physical reasons for those observations. We will 
demonstrate that the mechanism responsible for the presence of optical vortices in focused 
fields is totally different from the one responsible for the observation of optical vortices in 
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scattering experiments. This, in our opinion, shows that the SAM-OAM formulation can be 
quite misleading: it assigns the same explanation to two distinct physical phenomena. 

The analytical tools and methodology employed in this section allow a simple application 
of the ideas developed in the previous section to practical electromagnetic problems. 



A. Bessel beams with well defined angular momentum and helicity 

As already mentioned, J z and A commute. For our analysis we will need a basis of 
transverse electromagnetic modes which are simultaneous eigenvectors of J z and A. One 
such set of modes is a particular type of Bessel beams. Bessel beams have been thoroughly 



studied. See for instance the series of papers [28], [3l(, and [321 ] . 



In appendix [Bj we constructively derive a complete orthonormal basis for transverse elec- 



tromagnetic fields consisting of Bessel beams with wel 
momentum J z and helicity A. These modes appear in 



defined third component of angular 



2l|, although their relationship with 



helicity is not considered in that paper. In [28| , these type of electromagnetic modes are ob- 
tained in a different way as linear combinations of other type of more commonly used Bessel 
beams, the transverse electric (TE) and transverse magnetic (TM) modes. The constructive 
derivation in appendix |B] shows clearly that the modes are eigenstates of A. 

From appendix (Bj equations ( fT5l) are the real space expressions in cylindrical coordinates 
[p, 9,z\. An implicitly harmonic exp(— iwt) dependence has been assumed. 



C nkp (p,e,z) = \/ ^-i"exp(i(fc z z + n0)) ^ — ((1 + -^)(J n+1 (fc p p)exp(j0)r + (1 - -^)J n _ 1 (fc p p) exp(-i0)ij - -^J n (fc p p)z 



T> nkp (p,d,z) = ^ ^-i n cxp(i(k z z + ne)) ^— ^(1 - -^)(J n+1 (fc p p)exp(j0)r + (1 + y)J n -i(fc p p)exp(-i6>)lj + -^J„(fc p p)z 

(15) 

where: 



_ U . Ih.2 T p 1.2 _ h.2 , h.2 

i _ x+iy - _ x-ty 
1 ~ V2 ' F ~ V2 ■ 

By construction, the two types of vector wave functions C n k p and D n ^ have a sharp 
value of the z component of angular momentum J z equal to n and a sharp value of he- 
licity A equal to —1 and +1, respectively. Additionally, they have well defined values for 
the energy H and the z component of the linear momentum P z proportional to k and k z 
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respectively. For a given value of k, a well defined value of P z implies a well defined value of 
the transverse momentum P p proportional to k p . Modes f fl5|) form a complete orthonormal 
basis of transverse electromagnetic fields when 

ke[0, oo), n G [0,±1,±2, ...], k p G [0, oo), and A = ±1, (16) 

and both signs of k z in k z = ±\Jk 2 — k 2 p are considered. In the following, the consideration 
of both signs of k z is implicitly made. 



B. Optical vortices in focusing 

In order to study why optical vortices seem to appear in numerical studies of focalization 
of apparently vortex free beams 9Ml lj . we analyze the aplanatic lens model 33|, which is 
routinely used to study the effects of high numerical aperture lenses. 

As we show in appendix O and has been discussed before [3Q| , the action of an aplanatic 
lens preserves J z and A. The cylindrical symmetry of the model can be reasonably expected, 
but its invariance under duality transformations is "hidden" in the assumption that the lens 
transmission coefficients for the two polarization components, TE and TM, are identical and 
that there is no crosstalk between input and output TE and TM components. That this 
assumption implies duality symmetry (helicity conservation) is obvious from condition f fl^|) 
and the discussion at the end of section HVCl The preservation of A by an aplanatic lens has 



been discussed in 30] without using its relationship to electromagnetic duality. In appendix 
Owe explicitly analyze the conservation of J z , highlight the model's key assumption on TE 
and TM scattering, and show that the model conditions for the conservation of A and J z 
are , as expected, independent of each other. 

Even though both cylindrical and generalized duality symmetries are preserved by the 
model, the focalized beam is quite different from the input beam: some other symmetry 
must be broken. The most obvious candidate is the lack of translational symmetry on the 
plane perpendicular to the optical axis of the lens. We know that the transverse momentum 
components P x and P y are the generators of that symmetry. Below we show that the 
differences between the input and focalized beams are due to changes in (k x ,k y ). We will 
prove this point using the basis introduced in (fl5l) . 

Let us take a collimated right circularly (RC) polarized Gaussian beam going through an 
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aplanatic lens of high numerical aperture. The linear momentum components of a collimated 
beam are all almost totally aligned with the propagation direction z: k z ~ k. Consequently, 
a collimated beam only has components with small transverse momentum value k p . With 
respect to f[T51) . in the limit when k z « k (% — > 0), both C n fc p and D„(; approach pure RC 
and LC polarized modes respectively. This can be easily seen by setting a -£ — > =>- (k z ~ k) 
in dH]): 



C nfcp (p,0,z) « y^ n+1 exp(^(M)^ + i(M ex P(^(^ + 1 ))r, (17) 
D nfcp (p,M « y£z n+1 exp(^(A; z z))J„_ 1 (A; p p)exp(^(r2 - 1))1. (18) 

The other polarization components, the opposite circular and the longitudinal z component, 
are strongly attenuated in this regime. Importantly though, they are actually present: 
without them, the modes are not solutions of Maxwell's equations, and its transformation 
properties cannot be consistently analyzed as general electromagnetic fields. 

From fl!7p . and since the collimated input RC Gaussian beam does not have a phase 
singularity in its f dominant polarization, we can conclude that mostly C type modes with 
n = — 1 will exist in its expansion in the ( fl5l) basis: 



E 



input 



poo 

/ dk/X^nk P , (19) 
JO 



where c^f^* is only significant when — >• 0. Equation (Tl9l) defines a beam with a sharp 
value of J z , n = —1, and a sharp value of A, A = —1. As per the above discussion regarding 
symmetries, the output beam must also have sharp J z and A values of n = —1 and A = — 1. 
Focusing can hence only change the relative weight of k p components. Intuitively, modes 
with higher transverse momentum are needed to expand the field after focusing. 



E /oc = / dk p c f _ l k C nkp , (20) 

which is in line the non-preservation of (k x , k y ) due to broken transverse translational sym- 
metry. The fact that the change is limited to k p = ^Jk 2 x + ky could have been foreseen: it 
stems from the cylindrical symmetry of the model. 

Now, let's go back to equation ffT5]) and check the spatial shape of C n k p modes when 
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n = —1 and the condition -A — > is not met: 

k 



C nkp {p,0,z) = - 
+ 




i k 



)Jo(k p p)r 



(21) 



]: ,J-%{k p p) exp(-i20)l - -^J-i(k p p) exp( 



The C n fc p , — >■ modes are almost purely right polarized, but when k p increases, the 
terms multiplying 1 and z become significant. As it can be seen in ( 12T1) . for n = — 1 
these newly enhanced terms have phase singularities of orders minus two and minus one 
respectively. Fig. [3] shows the radial spatial profiles of the three polarization components 



for two C-i^ (p, 9, z) modes, one with 



0.1 and the other with -A 



0.9. In the small 



k p case (Fig. 3(a) ), the dominant polarization component f is much stronger than the 1 and 



z components (which are nonetheless present). In the large k p case (Fig. 3(b)), the relative 
weight between the intensity of the different polarizations has shifted significantly. The 
vortices of charge —1 in z and charge —2 in 1 (see ( 1211) ) become relatively more important. 
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FIG. 3. (Color online) Normalized field intensity for the right, left and longitudinal polarization 
components for two C n k p , n = —1 modes, one with -A = 0.1 (left figure) and the other with 

k k 

-A = 0.9. Note the scaling of the non-dominant polarization components on the -A = 0.1 case. 

We argue that these are the optical vortices appearing in numerical simulations of focused 
beams, and that the correct explanation is not SAM to OAM conversion but just the inherent 
spatial properties of light modes with definite energy, P z , J z and A propagating through a 

18 



system that conserves energy, A and J z while breaking transverse translational invariance. 
The lens shifts the weight distribution towards modes with larger k p values and optical 
vortices already existing in the initially strongly attenuated polarization components of the 
input beam gain relative importance in the focalized beam. For the theoretical study of 
optical vortices in focused beams, [9| and [30] use solutions of the paraxial equation as the 
input modes, instead of using solutions of the full Maxwell equations as we have done. Since 
paraxial input modes do not have the attenuated phase singularities in the other polarization 
components because only a single polarization component is non zero, the appearance of 
optical vortices upon focusing was, contrary to this paper's explanation, attributed to SAM 
to OAM conversion. 

It is interesting to note that the property associated with AP Z /|P| is not preserved upon 
focusing. Helicity is conserved but, due to the lack of of translational symmetry in the 
transverse plane, P p changes, which implies that P z changes. The fact that the lack of a 
"geometrical" symmetry breaks the conservation law corresponding to AP 2 /|P| indicates 
that the symmetry transformation generated by such "spin operator" is, in general, not 
independent of the geometry of the problem. 



C. Optical vortices in scattering 

The experimental observation of optical vortices in scattered fields has been reported in 



the literature 
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■14j. In these papers, the observations are explained by means of spin to 
orbit angular momentum conversion durin g th e interaction with the scatterer. Recently, 
similar observations have been analyzed in 18| using symmetries and conserved quantities, 
and reaching a very different conclusion. In line with the discussion of the last paper, we 
will show in this section that the reason for all these experimental observations is not SAM 
to OAM transfer, but a partial helicity change during the light-matter interaction due to 
the breaking of electromagnetic duality symmetry in the system. We will also argue that 
the helicity change is enhanced by physical effects which strongly break duality, beyond the 
inherent duality asymmetry of general planar multilayer structures. 

We have already mentioned several times that helicity transforms independently of the 
geometry of the scatterer. In particular, it transforms independently of whether the con- 
sidered system has cylindrical symmetry or not. Nevertheless, as seen below, a change in 
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helicity is very clearly identifiable in the spatial patterns of the scattered fields when the 
system has cylindrical symmetry. Several of the experimental setups and input beams in 
the articles cited in this section have cylindrical symmetry and are similar to the system in 
Fig. HI which we will now analyze. 



Preparation Cylindrical symmetry Measurement 



LASER 



LP QWP| 



Lens 



Lens 



QWP LP 



CCD 



FIG. 4. Archetypal experimental setup. A collimated beam is circularly polarized by means of a 
linear polarizer (LP) and a quarter wave plate (QWP). After focusing, the beam interacts with 
a cylindrically symmetric target, in this example, a circular nano-aperture in a metallic film on a 
substrate. A portion of the scattered field is collected and collimated by a second lens, analyzed by 
a second set of QWP an LP, and its spatial profile is imaged into a Charged Couple Device (CCD) 
camera. The two orthogonal settings of the last LP allow the projection of the two collimated 
helicity modes (see the text for details). 

In Fig. HI we distinguish the preparation and measurement apparatus from the focusing 
and scattering part of the setup. In the preparation stage, a collimated Gaussian beam goes 
through a linear polarizer (LP) and quarter wave plate (QWP), which results in a beam 
with well defined values of J z and A (see more details below). In the central part, the beam 
is focused onto a cylindrically symmetric target and the resulting scattered light is collected 
by another lens. The measurement part of the setup uses another QWP and LP to project 
light with different helicity depending on the setting of the LP (see more details below). 

At first sight, the central part is cylindrically symmetric but lacks translational symmetry 
in the transverse plane. We should hence expect conservation of J z and, as in section ( jVBl) . 
non-conservation of P p . Since the lenses preserve helicity, the behavior of the system with 
respect to A depends on whether the target sample meets condition ([8]). Let us assume that 
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it does not meet such condition. 

Let us imagine that, after the first LP and QWP, the input to the first lens is a pre- 
dominantly RC polarized Gaussian beam. Assuming perfect optical elements, and using the 
same arguments as in section IV Bl we can see that the input beam can be expanded using 
basis (II 5p into modes with — > and sharp values of J z , n = —1, and of A, A = —1. The 
first lens will focus the beam into the sample without altering either J z or A, then, since it 
is assumed that the sample does not meet ([H]), part of the light will change helicity upon 
interacting with the cylindrically symmetric target and, after collimation with the second 
lens, there will be two types of collimated modes, both with n — — 1, but differing in the 
value of helicity. Schematically representing the actions of the lenses and the cylindrically 
symmetric scatterer as transformations of fields expanded in the basis ffT5]) by coordinates 
c n k p and d n k p , we may summarize the whole sequence as: 

Lens Scattering , , , Lens / , \ 

c _i^o >C -^ P > (c-i,fc p ,d-i,fcj ► fc ^ d J. (22) 

' k \ 3 k 1 k / 

The corresponding modal expressions for the components of the two types of output 
modes are: 

Ci!±^n(P'9' z ) ~ \ —i 2 exp(ik z z)J (k p p)r. 

k Vtt_ (23) 

[k 

D_ 1 fc £ ^ (p, 0, z) ~ W -^-i 2 exp(ik z z)J- 2 (k p p) exp(—i28)\. 

That is, a predominantly RC beam similar to the input, without any phase singularity 
in f and a predominantly LC vortex beam with a singularity of charge two in 1. If we set 
the last LP to project the LC component of the output collimated beam, this vortex will 
appear in the CCD camera. 

We argue thattee are ^vertices observed in the cylindrically symmetric scattering 

Figs. 2(c)-(d)] and [14j . and that the underlying reason 



experiments of 12J, Fig. 4] 
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for their appearance is not SAM to OAM transfer, but that the electromagnetic duality 
symmetry is broken in those systems. 
The samples used in [3] and 



13] consist of nano-apertures on metallic thin films. The 
breaking of duality in a general planar multilayer structure, and in particular when it includes 
a metal, is clear since the relative electric and magnetic constants of the system will not 



meet condition (JSJ). Nevertheless, as shown in 

21 



181 ] . the effect due to the multilayer alone 



is typically small in terms of helicity conversion. The reason why the helicity conversion is 
enhanced making it easily detectable, is that the nano-apertures allow for light to couple to 
the natural modes of the multilayer system, where duality is strongly broken. In a multilayer 
system, the natural modes are either TE or TM resonances. Since a mode with well defined 
helicity has equal contributions from TE and TM components (IT2"|) . a TE only or TM only 
resonance implies a strong breaking of electromagnetic duality. The interfaces between a 
metal and a dielectric allow the existence of surface plasmon polaritons of TM only character. 
As shown in [18j, the influence of these modes in the transmitted light is responsible for the 
experimentally detectable helicity change. 

In [ijj], optical vortices are observed upon propagation of light through a planar semi- 
conductor microcavity, still a cylindrically symmetric system as noted in that work. In this 
case, duality is strongly broken in the multilayer itself by the energy splitting between TE 
and TM modes in the structure. This splitting is identified in that work as the enabler for 
SAM to OAM conversion. 

References 12| and 13] contain also results for non-cylindrically symmetric setups. Even 
though their study using J z is not as simple, we are confident that the generality of the 
methodology that we propose can rid their analysis from the artificial concept of SAM to 
OAM conversion, possibly leading to further physical insights. For instance, for the squared 
nano-apertures studied in Q, the same argumentation about the nano-aperture mediated 
coupling onto strong duality breaking multilayer natural modes applies. We postulate that 
any instance where SAM to OAM conversion is invoked can be properly explained using our 
framework. 



VI. CONCLUSIONS AND DISCUSSION 

Using the helicity of light fields to represent the polarization degrees of freedom, we have 
introduced a general framework for the study of light beams with angular momentum and 
their interactions with matter. In particular, the framework does not depend on the appli- 
cability of the paraxial approximation. Our proposal is based on two hermitian operators 
in the Hilbert space of transverse electromagnetic fields: the helicity A and a component 
of angular momentum J z . These operators are the generators of simple transformations of 
the fields: generalized electromagnetic duality and rotations along the z axis respectively. 
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Since the generalized duality transformation is independent from rotations or translations 



181 ] . the use of A allows to consider the changes in polarization independently from other 
degrees of freedom like angular momenta (J) and linear momenta (P). This simplification is 
crucial for the practical applicability of the framework. We recall that the approximations 
implicit in the macroscopic Maxwell's equations are assumed in this paper. 

We propose this new framework as a substitute of the current state of the art treatment, 
which is based on the separation of spin and orbital angular momenta for the description of 
the angular momentum of light, and on the mechanism of SAM to OAM transfer in light- 
matter interactions and other situations. We have shown that it avoids the fundamental 
problems associated with the separation of SAM and OAM in a simpler fashion than the 
current theoretical solutions. We have also demonstrated its predictive power based on 
analyzing the broken and unbroken symmetries of the system to predict which properties 
of the light will change and which ones will be preserved. The current framework lacks 
predictive power and, at this point, can only be qualified as a descriptive theory. This 
is illustrated by the fact that, using our methodology, we have shown that, phenomena 
commonly explained as SAM to OAM transfer in focusing and scattering are actually due 
to two completely independent physical reasons, showing the inconsistency of the current 
framework. The results of this paper suggest that such inconsistency may be related with the 
use of quantities like SAM and OAM, which are not valid operators for transverse Maxwell 
fields and hence cannot be related to meaningful transformations of the fields. 

We are confident that the use of helicity and its associated transformation will become a 
powerful theoretical and practical tool which will improve the understanding and control of 
light-matter interactions at the nano-scale. Applications of the ideas presented in this paper 
can be foreseen in nano-optics, for the control of the shape and polarization of electromag- 
netic fields, in metrology, for probing the equivalence of the electric and magnetic responses 
of a system, and in quantum science and technology, where the identification of two truly 
independently observable degrees of freedom of the field like A and J z should allow to better 
understand the potential benefits of the use of angular momentum in quantum applications. 

The association of helicity and duality with other degrees of freedom and their corre- 
sponding transformations constitutes a general and robust methodology to study practical 
light-matter interaction problems by using fundamental concepts. 
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Appendix A: Helicity eigenstates and transverse electric and transverse magnetic 
modes 

We set out to proof a general relationship between electromagnetic states of well defined 
helicity and the transverse electric and transverse magnetic modes. We start by making 
use of an elegant method for finding solutions of the monochromatic Maxwell equations in 



a source free, isotropic and homogeneous medium [20j, chap. 13.1], [34, chap. VII]. Under 
some suitable conditions of the coordinate system {ui,U2,Us}, two transverse independent 
solutions of the vectorial Helmholtz equation of the medium can be obtained from a separable 
solution i[)(ui,U2, U3) = Ui(ui)U 2 (u2)U 3 (u 3 ) of the corresponding scalar Helmholtz equation. 
With w a unit vector perpendicular to the surface of constant coordinate u\ = C, the two 
vector solutions are obtained as: 

M(r) = V x (wV>) and N(r) = V X M ^ , (Al) 

k 

There are six different coordinate systems for which a complete basis for transverse elec- 



tromagnetic fields can be built in this way 20|, chap. 13.1]. Plane waves, multipoles and 



Bessel beams result from using cartesian, spherical and cylindrical coordinates, respectively 
34j . chap. VII]. Besides being eigenvectors of the Hamiltonian due to its monochromatic 



character, each coordinate system produces electromagnetic modes which are eigenvectors 
of a different set of operators: 

• plane waves with w = z: eigenvectors the of transverse momenta k x and k y . 

• Multipoles with w = -r^: eigenvectors of the squared angular momentum norm J 2 and 
a component of angular momentum, for instance J z . 

• Bessel beams with w = z: eigenvectors of the third components of the linear and 
angular momenta P z and J z . 
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From now on, we will lump the energy H, which is proportional to k, and these other 
degrees of freedom into a collective index v and use M„ and N„. In all three reference 
systems, the M„ and N, modes are commonly referred to as TE and TM modes, respectively. 

Using Vx = S • P |22|, XIII. 93], it can be proved that the helicity operator A can be 

Vx 
k 

chap. 13.1]. Then, together with (lAlj) . we see that: 



written in real space as A = In such formulation, it is also true that M, = AN, 
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N, = AM,, M, = AN,. (A2) 

Namely, the TE and TM modes are transformed into each other by the application of A. 
We can hence obtain electromagnetic fields with well defined helicity as: 

A+ = -L ( M „ + N„) , AA+ = A+ 

V ( A3 ) 

A; = -j= (M, - N,) , AA; = -A-. 

Equations ( 1A3I) already make it obvious that helicity conservation will only happen for 
equivalent scattering of the TE and TM components of the field, which is needed in order 
to preserve their linear combinations. 

It is clear that, since {M,, N,} are an orthonormal basis for transverse fields when all 
values of v are considered, so is {A+, A~}. This derivation of the TE, TM modes is valid 
for the systems of coordinates mentioned above. On the other hand, the helicity operator 
is well defined for any basis. Thus, we think that it is more natural to define the TE and 
TM modes as M, = (A+ + A~) , N„ = (A+ — A~), and as such we use them in the 
main text. 



Appendix B: Derivation of Bessel beams with well defined helicity 

In this appendix we present a constructive derivation of electromagnetic modes with well 
defined energy H, third component of angular and linear momenta J z and P z , and helicity 
A. 

We start by considering TE and TM modes of well defined energy and linear momentum. 



They are plane waves derived following the constructive procedure given in 20j, chap. 13.1] 



34j . chap. VII] which was already used in section [IV CI Their explicit expressions are, in the 
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cartesian [x, y, z] basis: 



ss • exp(ik • r) = — (k y x. — k x y) exp(ik • r) 



p • exp(zk • r) 



-kg (k x x + k y y) + k 2 p z 



exp(ik • r), 



(Bl) 
(B2) 



where k p = ^k% + k^, and [k x , k y , k z ] and k are proportional to the linear momentum vector 
and energy of the plane waves. As per equation (1A3|) . sum and subtraction of TE and TM 
modes result in states of well defined helicity: 



e + exp(zk • r) = —= (s + p) exp(zk ■ r), 
a/2 



e_ exp(ik • r) 
Interestingly, any plane wave of wel 



V2 



(s — p) exp(ik ■ r) 



(B3) 



defined helicity and momentum vector proportional 
to [k x , k y , k z ] can also be obtained [35] by rotating a plane wave of the same helicity and 
momentum [0,0, fc]: 

e + exp(ik • r) = R(0&, <pk) ( ~~ lexp(ikz) ) 

V ) (B4) 

e_ exp(ik • r) = R(0&, 4>k) (r exp(ikz)) . 

arctan^. Equation (1B4|) exploits the 



arcsin and (j) k 



where l=*±f, f = ^, 9 k - , 
fact that helicity does not change under spatial rotations. 

A rotation operation of a vectorial field R(0fc, </>k) (A(r)) is, explicitly: 

A(r)-^fl(^ ) ^)A(ir 1 (^ ) ^r), 

where R(4> k , &k) is the rotation matrix: 



(B5) 



R{(j> k ,e k ) = R z {(j> h )R 



\ 



\ 



J 



cos cos 0fc — sin 0^ sin 9 k cos ^ 
y \Pk) = cos9 k sm(/)k cos(pk sin^ fc sin0 fc , (B6) 
sin 9k cos 

where R z (4> k ) and R y (9 k ) are rotations around the z and y axis respectively. 

We now start our construction of modes with well defined H, P z , J z and A by expressing 
the most general monochromatic forward propagating transverse electromagnetic field as a 
combination of plane waves with well defined momentum: 



A 



sin9 k d9 k 
o Jo 



2tt 



<\[(i,,.(Dk)'R(9k,(/)k) [-lexp(ikz)) + (3(9 k ,(f)k)~R(9 k} (j) k ) (rexp(zfe)) 

(B7) 
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A backward propagating beam can be obtained by using — lexp(— ikz) and f exp(— ikz) as 
the initial plane waves instead. 

Having restricted (1B7j) to a single wavenumber k assures that the resulting mode has a 
definite energy H proportional to k. With respect to A, it is clear from (IB7j) that, if we 
desire a field A with well defined helicity, we need only to set either a(9k,fa) or f3(9k,4>k) 
equal to zero for all (6k, <pk), so that only plane waves of the same helicity type are present 
in (IB7p . With respect to having a well defined P z , and because, in a plane wave, its value is 
proportional to k z = k cos 6k, we must include only a single value of 6k in (1B7j) . 

The conditions needed in (IB 71) to specify a beam with well defined J z are not as apparent 
as in the case of H, P z and A. The solution can be reached by applying the operator J z 
to the general mode A. In order to do that, we use the following definition of J z as an 
infinitesimal rotation operation: 

j, = lim '-ww m 

d<f> k -*0 ld(j)k 

where I is the identity operator. 

Applying (1B8I) to (IB 71) and making use of the properties of rotation operators we obtain: 

r [ii ,. H ■ njn f 2 " ' AA> la A , ^k, fa) ~ K(9 k , fa + dfa) ( f , A , 

J z A = hm / smdkddk dfaa(9 k) fa) tji [-lexp(ikz) ) + 

d4>k^oJ J idfa V J 



^(M t ) R(Mt) '"l yt + " W (fex P (^)) 
id(p k 



(B9) 

which, after integrating by parts can be reduced to: 

J Z [A\ = [ 2 sm6 k de k d^ ^^M ^dkAk) (-lexp(ikz)) + d ^ k ; 0fc) R(9 k , fa) (iexp(ikz)) 
Jo Jo % v<pk V / id(p k 

(BIO) 

So, in this representation, the operator J z acts by taking the partial derivative of the coor- 
dinates (a(9k,(f>k), (3(9k,4>k)) with respect to fa and dividing by i. 
Gathering together all the above, we can assert that 

a(9 k , fa) = ^5(9 k - 9' k ) exp (infa), 0(9 k , fa) = 0, (Bll) 

specifies a mode with well defined energy H proportional to k, well defined A with value 
A = 1, well defined J z with value n and well defined P z proportional to k z = kcos(9' k ), and 
that 

a(9 k , fa) = 0, 0(9 k , fa) = ^-5(9 k - 9' k ) exp (infa), (B12) 

Z7T 
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does the same for the opposite helicity. 

After inserting the specified coordinates into equation (1B7|) . substituting the rotated plane 
wave by their explicit expressions as linear combinations of (IBip and (IB2I) . changing basis 
from [x, y, z] to [f , 1, z] and using exp(ik • r) = ^ m i m J m (/c p p) exp(zm(0 — <pk)) exp(ik z z) 
before performing the integral in d(pk, we finally obtain the real space expressions of the 
modes in cylindrical coordinates that have been extensively used in the main text: 

fk / i ( k k A k \ 

C nkp (p,(f>,z) = y -^-i n exp(i(k z z + n<f>)) ^— ^(1 + -^)(J n+ i(fc p p) cxp(i</>)f + (1 - J n -i (k p p) cxp(-i<f>)\J - -^J n (k p p)iJ 

fk / i ( k z k z -\ k 

£>nk p (fi, <M) = Y -^i™ exp(i(k z z + n<j>)) ^— ^(1 - — )( J n +i (k p p) cxp(«0)f + (1 + -^) J„_i (k p p) cxp(-i<£)lj + -^J n (k p p)i 

(B13) 

Note that, due to the integration limits of 9k in ( \B7\i , the derivation is restricted to 
propagating modes. In reality, values of k p > k in equations f)B13j) are possible, specifying 



non-propagating modes with well defined H, P z , J z and A. Including non-propagating 
modes from the start of the derivation can be done by using imaginary values of 9k in order 
to obtain values of sin(6 l ) bigger than one. 

Appendix C: Preservation of J z and A by the action of an aplanatic lens 

The aplanatic or spherical lens model allows to relate the real space field profile of the 
collimated input beam to the angular spectrum of the focused beam. Originally developed 



by Richards and Wolf in 
in 
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331 ] . we can find an explanation of the model and its formulas 



chap. 3.5], which we reproduce here using a slightly different notation using that 



exp(ikz cos9k) exp(ikpsm9k cos(0 — 4>k)) = exp(ik • r): 



E out (p, 4>, z ) 



ikfexp(-ikf) f e k L / - 



2- 

E 



/ / sm6 k d8 k d<f> k E oo (flfe,0it)exp(ik • r) (CI) 
Jo Jo 



(9 k ,4> k ) = [t s (e k )(S 0>4>k ■■E inc {f S m{e k ),^ h ))se kAh +tP(e k ) (p o , 0fc ■ E mc (/sin(0 fc ),<fe)) Pe h ,^ h ] (cos0 fc ) 1/2 



(C2) 



Where: 



E ou t(p, 0, z) is the focused field in real space expressed in cylindrical coordinates 
[p,0, z]. 
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• / is the focal distance of the lens, 9™ = arcsin(NA), where NA is the numerical 
aperture of the lens, and k the wavenumber of the field. 

• [p, 0, z = zq], with p = /sin(0fc) and = <pk are both the real space cylindrical 
coordinate system for the input beam Ej nc (p, 0, z = zq) and the spherical coordinates 
in momentum space of 'E 00 (9k, 0fc), the angular spectrum of the focalized output beam, 
with 9k = arcsin(^), 0^ = arctan(| £ ). This dual role of the coordinates is the essence 
of the model. 

• {t s {9k),t p (9k)) are the lenses TE and TM transmission coefficients and (n 1 ,n 2 ) the 
indexes of refraction of the input and output media. 

We emphasize that the model is valid for a collimated input only. 

The definitions of the polarization vectors s a ^ and p a ^ are those of (JEU) and with 
a = arccos(-y) and ip = arctan(^). 

We now analyze the properties of the model with respect to conservation of J z and A. 

We start by using some of the ideas and techniques of appendix [B] in order to calculate the 

angular momentum of the focused beam when the input beam has a definite value of J z . 
Using ( 1B7[) . expression ( 1C1I) can also be written as: 



E out (p,0,z)= / sin(9 k )de k / d<f> [g s (B k , (j> k )Tl(6 k , <f> k ) (-if exp(ik ■ r)) + g p (9 k , <j> k )R(6 k , 4> k ) (-x exp(ik ■ r))] , (C3) 



As seen in appendix [Bj in order to apply the operator J z to the output focused beam in 
( 1C3I) . we need to take the partial derivative of its coordinates (g s (0k, 0fc), 9p(9k, 4>k)) with 
respect to 0^ and divide by i. Equation flC4j) relates (g s (9 k , 0fc), g p (9 k , 4>k)) to the input 
beam. Note that the effect of the lens, which is given by the transmittivities t s (9k),t p (9k), 
does not add any azimuthal dependence to g s (9k, 0fc) and g p (9k, <pk)- Thus, with this model, 
the lens will not affect the azimuthal dependence of the input beam, hence keeping the 
angular momentum constant. More explicitely, since vectors So,0* , Vo,<j> k are proportional to 





where 



g s {0kAk) = t s (9 k ) (sq,^ ■ E inc (fsxa(9 k ),(f) k )) 
g P (9k,<fik) = t p (9 k ) (p 0A ■ E inc (/sin(6 l fc ),0 A; )) . 



(C4) 



29 



the azimuthal and radial polarization vectors defined as: 



' - sin fe ^ 



COS0 fc 





Ax)S0/A 



sin (p k 




(C5) 



the focalized field angular spectrum coordinates at point (9k,4>k), (9s(9k, 4>k), 9 P {9k, 0fc))> 
are the real space azimuthal Ef nc and radial E? nc components of the input field at point 
p = f sin(9 k ),<f) — 4>k- In order to compute the angular momentum of the focused field we 
need to take the partial derivatives of the radial and azimuthal components of the input 
field with respect to <p k = <p and divide by i. Let us now find explicit expressions for Ef nc 
and E p inc . 

The collimated input beam can be expanded in the plane z = z into modes of the 
type ( ITT)) and (ITS)) with — > 0, which we rewrite here after the change of basis f = 
75 (p ~ i$) exp(-z0), 1 = -j-(p + i$) exp(i0): 



C nkp (p,(f),z = z ) ~ \/^p n+ exp(ik z z )J n+1 (k p p)exp(i(j)n)-j=(p + i(j)), 

~k~ 1 

D nkp (p,(f),z = Zq) w \l— i n+1 exp(ik z z )Jn-i(kpp)exp(i4>n)— =(p - i(p). 



(C6) 

Note that in order to avoid confusion with 9k, the letter for the input beam coordinate 
angle arctan(y, x) is now 0, instead of the original letter 9 in expressions (ITT)) and ( ITS)) . 

The sum and subtraction of the above modes result in approximately pure radially and 
azimuthally polarized modes, which separates the radial and azimuthal components of the 
field: 



2k 



1 



A n fc p (p, 0, z = z ) « W i n+ exp(zfc z z ) [J n +i{k p p) + J n+1 {k p p)} exp(i<jm)— =p = A p nk (p, <p, z )p 



7T 



V2 f 



'2k 1 - i 

B nkp (p,(f),z = z ) « -\/— ^i n+2 exp(iA; 2 2 ) [J„ +1 (A; p p) - J n+1 (fc p p)] exp(i0n)-^=0 = B^ kp (p,(j), z 



(C7) 



which means that, ()C4j) is explicitly 



g s {9 k Ak) = t s {9 k )Ei nc = t s (9k)Bt kp (fsm(9k)Ak,z ), 
g P (9 k Ak) = t p (9k)Ef nc = t p (9k)Ai ko (fsm(9k),<f> k ,z ). 



(C8) 
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Then, we need to take the partial derivative of the azimuthal and radial input field 
components B^ k (f sin(0fc), <frk, z = z ) and A^ kp (f sin(0fc), 0^, z = Zq) with respect to 4>k and 
divide by i. Let us now assume that the input is a field with definite angular momentum 
J z = n. Inspection of flCJTj) reveals that such operation will leave those components invariant 
except for a multiplicative factor equal to n. So, the output field is also a field with definite 
angular momentum equal to n. We have just proved that the aplanatic lens model transfers 
the J z value of the input beam to the output beam without changing it. 

We now study the behavior of the model with respect to helicity. From (1C4I) . it becomes 
clear that the TE and TM content of the output field are independently determined by the 
real azimuthal and radial components (1C7I) of the input field respectively. Having taken 
linear combinations of modes with defined helicity, and using the results in appendix |Aj it 
follows that the modes in (10T|) are also pure TE and TM modes respectively. Using the 
results from section HV CI about helicity preservation in the TE TM basis, we now see that, 
since there is no cross-talk between the TE and TM components of the input and focalized 
fields, the key condition for helicity preservation in the aplanatic lens model is: 



ts(0 k )=t p (9 k )V9 k . (C9) 



In real manufacturing of microscope objectives, this condition is related to the coating of 



the lens surfaces 36j, chap. 3.6], that is, a property of the materials and not the geometry 
of the system. 

Let us assume for a moment a more general dependence of the transmission coefficients 
ts{^ki4 > k)^p{^k-i ( Pk)- The new (f) k dependence could destroy the J z preservation since now 
more terms will be involved in the partial derivatives of g s {0k,4>k) and g p (6k,4>k)- On the 
other hand, as long as g s (0 k , 4>k) — 9p(9k, 4>k) for all (9 k , 4>k)i helicity will be preserved. This 
is another example of the independence of the conservation laws of J z and A. 
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